Add Joins #57

JosephTLockwood · 2023-07-06T20:09:42Z

Adds left, right, inner, and outer joins. It also has some tiny bug fixes.

(cherry picked from commit 0cb0a58)

(cherry picked from commit 399777f)

(cherry picked from commit b291f05)

(cherry picked from commit 83dc405)

(cherry picked from commit 17ef1c5)

(cherry picked from commit 89a7acf)

(cherry picked from commit 7853852)

(cherry picked from commit 292cb45)

(cherry picked from commit 512498d)

JosephTLockwood · 2023-07-06T20:29:36Z

The more I think about the functionality of this the more I feel that it should have input similar to your filter method. Thoughts?

paul-griffith

It might be worth a larger refactor approaching this from a slightly different angle to ease testing and implementation.
What if the different join functions were regular Kotlin functions that accept the two datasets and the column args.
Then the primary unit testing effort can be focused on the plain Kotlin methods (with no need for extra Python scaffolding), and can use DSBuilder to construct test datasets more easily.
Once the actual method implementations are proven well, it's easy to write scripting-focused methods that only have to care about parsing the args and passing them to the inner functions - and since there's a lot of shared boilerplate, that can be tested as one piece as well.
So it's 4 distinct tests for the join functions, plus 1 test of Python argument parsing, instead of 4 tests with lots of overlap for argument handling. Plus, well structured, the argument handling could probably be repeated in each script function implementation, which avoids extra work if requirements for the function(s) change.

paul-griffith · 2023-07-06T20:59:37Z

common/src/main/kotlin/org/imdc/extensions/common/DatasetExtensions.kt

+        val column = parsedArgs.requirePyObject("columnIndex").toJava<Int>()
+        val column2 = parsedArgs.requirePyObject("columnIndex2").toJava<Int>()


Should be able to just do parsedArgs.requireInteger("columnIndex") to get a primitive int back; saves some work

paul-griffith · 2023-07-06T21:01:34Z

common/src/main/kotlin/org/imdc/extensions/common/DatasetExtensions.kt

+            val listToAppend = arrayOfNulls<Any?>(combinedColumnName.size)
+            var row2: Int? = null
+
+            dataset2.rowIndices.forEachIndexed { rowIndex, _ ->


Maybe try firstNonNullOfOrNull {} here to express what you're trying to do more succintly

paul-griffith · 2023-07-06T21:04:41Z

common/src/main/kotlin/org/imdc/extensions/common/DatasetExtensions.kt

+        names = ["dataset", "columnsToSplit"],
+        types = [Dataset::class, Array<Array<Int>>::class],
+    )
+    fun splitter(args: Array<PyObject>, keywords: Array<String>): Array<Dataset?> {


Returning as an array is a little weird - a plain list would be more typical for Python/Java/Ignition

(cherry picked from commit 0cb0a58)

(cherry picked from commit 399777f)

(cherry picked from commit b291f05)

(cherry picked from commit 83dc405)

(cherry picked from commit 17ef1c5)

(cherry picked from commit 89a7acf)

(cherry picked from commit 7853852)

# Conflicts: # common/build.gradle.kts # common/src/test/kotlin/org/imdc/extensions/common/DatasetExtensionsTests.kt

(cherry picked from commit 399777f)

(cherry picked from commit 7853852)

JosephTLockwood · 2023-07-19T21:27:51Z

I plan to revamp this. Currently, my thought is, it would be nice to have something like

system.dataset.joinOn(joinType="left",joinOn=lambda **kwargs d1, d2: d1['column1'] == d2['column2'] and d1['column1']>5

This could be a powerful tool. My current hang-up is passing two datasets into **kwargs.

paul-griffith · 2023-07-19T22:35:37Z

I like that proposed syntax. You might like Phil Turmel's Simulation Aids; particularly the crazy stuff he's doing with expressions: https://forum.inductiveautomation.com/t/automation-professionals-simulation-aids-v2/74934

This could be a powerful tool. My current hang-up is passing two datasets into **kwargs.

I don't think the Kotlin function should expect the Python function to accept kwargs at all. A join is effectively a function (leftDatasetRow, rightDatasetRow) -> Boolean.
So the Kotlin side should probably be providing a tuple where each item is a dictionary columnName: valueAtRow/Col, much like system.dataset.filter/system.dataset.map do.
And then Python function would be written something like:
lambda d1, d2: d1['column1'] == d2['column2'] and d1['column1']>5

JosephTLockwood · 2023-07-20T02:34:40Z

Thanks for pointing me back to that page. That was where I originally started my search. It appears 1 hour, and he added left join functionality 😆 https://forum.inductiveautomation.com/t/automation-professionals-simulation-aids-v2/74934/42 I'll give it a shot tomorrow. If it works, I'll leave it up to you to close this. I enjoy the learning experience of this and like how I have full customization and the fact that it currently works. However, I probably won't spend much time on it if this works, as I have a full platter and would only be able to work on it later.

JosephTLockwood · 2023-07-20T20:33:34Z

Okay, so this should be a working left join. It was a lot simpler than I thought. I still need to clean it up and put the join type as an option, but then it should be good to go after that (I believe).

JosephTLockwood · 2023-07-21T13:18:19Z

It should be good to go. I ended up going with column indexs instead of column names. I can change it if you want. I just though this was more elegant.

Example of Left Join:
utils.joiner(dataset, dataset2, 'left', lambda d1, d2: (d1[0] == d2[0]))

JosephTLockwood · 2023-07-28T14:08:20Z

I plan to make a change to this. I am not sure how I overlooked this, but for some reason, I had a break after the first match was found. This makes it not a left join.

paul-griffith · 2023-07-28T14:45:08Z

A work-in-progress refactor of what I was alluding to earlier with breaking things out to make unit testing a bit easier:
JosephTLockwood#2

JosephTLockwood added 9 commits July 6, 2023 11:56

Implement left join of two datasets with working test case.

e2d9eff

(cherry picked from commit 0cb0a58)

Add description for leftJoin

e5d2488

(cherry picked from commit 399777f)

Change type of columnIndex to Int

a2a4991

(cherry picked from commit b291f05)

Set temp value

881d584

(cherry picked from commit 83dc405)

Clean up

7c8d081

(cherry picked from commit 17ef1c5)

Clean up

9577aa9

(cherry picked from commit 89a7acf)

Add splitter of dataset

51f3c0f

(cherry picked from commit 7853852)

Update withClue

080eb6c

(cherry picked from commit 292cb45)

Get rid of illegal reflective access warning

f443427

(cherry picked from commit 512498d)

paul-griffith self-requested a review July 6, 2023 20:58

paul-griffith requested changes Jul 6, 2023

View reviewed changes

JosephTLockwood and others added 15 commits July 11, 2023 10:24

Merge branch 'IgnitionModuleDevelopmentCommunity:main' into left-join

9009d16

Get rid of illegal reflective access warning

a91c538

Remove unnecessary lambda

8199b0e

Update inductiveautomation/ignition Docker tag to v8.1.29

fdc341c

Implement left join of two datasets with working test case.

bdb8087

(cherry picked from commit 0cb0a58)

Add description for leftJoin

df77faf

(cherry picked from commit 399777f)

Change type of columnIndex to Int

8b64dbe

(cherry picked from commit b291f05)

Set temp value

9920015

(cherry picked from commit 83dc405)

Clean up

ea28eb3

(cherry picked from commit 17ef1c5)

Clean up

8d32fad

(cherry picked from commit 89a7acf)

Add splitter of dataset

9aa819a

(cherry picked from commit 7853852)

Merge remote-tracking branch 'origin/left-join' into left-join

6608026

# Conflicts: # common/build.gradle.kts # common/src/test/kotlin/org/imdc/extensions/common/DatasetExtensionsTests.kt

Add description for leftJoin

7ec8fa7

(cherry picked from commit 399777f)

Add splitter of dataset

edea280

(cherry picked from commit 7853852)

Merge remote-tracking branch 'origin/left-join' into left-join

1082b0d

Convert to PyObject

f98b789

JosephTLockwood added 2 commits July 20, 2023 16:33

Clean up

9ab8ad2

Add right, left, inner, outer to joiner function.

d06742f

		val column = parsedArgs.requirePyObject("columnIndex").toJava<Int>()
		val column2 = parsedArgs.requirePyObject("columnIndex2").toJava<Int>()

Add Joins #57

Are you sure you want to change the base?

Add Joins #57

Uh oh!

Conversation

JosephTLockwood commented Jul 6, 2023

Uh oh!

JosephTLockwood commented Jul 6, 2023

Uh oh!

paul-griffith left a comment

Choose a reason for hiding this comment

Uh oh!

paul-griffith Jul 6, 2023

Choose a reason for hiding this comment

Uh oh!

paul-griffith Jul 6, 2023

Choose a reason for hiding this comment

Uh oh!

paul-griffith Jul 6, 2023

Choose a reason for hiding this comment

Uh oh!

JosephTLockwood commented Jul 19, 2023

Uh oh!

paul-griffith commented Jul 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JosephTLockwood commented Jul 20, 2023

Uh oh!

JosephTLockwood commented Jul 20, 2023

Uh oh!

JosephTLockwood commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JosephTLockwood commented Jul 28, 2023

Uh oh!

paul-griffith commented Jul 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

paul-griffith commented Jul 19, 2023 •

edited

Loading

JosephTLockwood commented Jul 21, 2023 •

edited

Loading